NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

RLang: A Declarative Language for Describing Partial World Knowledge to Reinforcement Learning Agents

Rodriguez-Sanchez, Rafael; Spiegel, Benjamin; Wang, Jennifer; Patel, Roma; Tellex, Stefanie; Konidaris, George (July 2023, Proceedings of the 40th International Conference on Machine Learning)

We introduce RLang, a domain-specific language (DSL) for communicating domain knowledge to an RL agent. Unlike existing RL DSLs that ground to single elements of a decision-making formalism (e.g., the reward function or policy), RLang can specify information about every element of a Markov decision process. We define precise syntax and grounding semantics for RLang, and provide a parser that grounds RLang programs to an algorithm-agnostic partial world model and policy that can be exploited by an RL agent. We provide a series of example RLang programs demonstrating how different RL methods can exploit the resulting knowledge, encompassing model-free and model-based tabular algorithms, policy gradient and value-based methods, hierarchical approaches, and deep methods.
more » « less
Full Text Available
Skill Generalization with Verbs

Ma, Rachel; Lam, Lyndon; Spiegel, Benjamin; Ganeshan, Aditya; Patel, Roma; Abbatematteo, Ben; Paulius, David Paulius; Tellex, Stefanie; Konidaris, George (October 2023, Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems)

It is imperative that robots can understand natural language commands issued by humans. Such commands typically contain verbs that signify what action should be performed on a given object and that are applicable to many objects. We propose a method for generalizing manipulation skills to novel objects using verbs. Our method learns a probabilistic classifier that determines whether a given object trajectory can be described by a specific verb. We show that this classifier accurately generalizes to novel object categories with an average accuracy of 76.69% across 13 object categories and 14 verbs. We then perform policy search over the object kinematics to find an object trajectory that maximizes classifier prediction for a given verb. Our method allows a robot to generate a trajectory for a novel object based on a verb, which can then be used as input to a motion planner. We show that our model can generate trajectories that are usable for executing five verb commands applied to novel instances of two different object categories on a real robot.
more » « less
Full Text Available
“Was it “stated” or was it “claimed”?: How linguistic bias affects generative language models

https://doi.org/10.18653/v1/2021.emnlp-main.790

Patel, Roma; Pavlick, Ellie (November 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing)

Full Text Available

Search for: All records